Language Modeling Based Local Set Reranking using Manual Relevance Feedback
نویسندگان
چکیده
We present a novel approach to re-ranking documents using language modeling (LM) and manual relevance feedback (RF). The documents returned by an initial search algorithm, called the Local Set, is reranked based on manual relevance feedback using a ranking function modified to perform at the local set level. Instead of using the query independent collection model, which is too general, we use the query-specific local set, to model the background distribution. The resultant relevance model learns a more specific set of terms relevant to the query. We achieve better ranking performance than existing approaches that employ both LM and RF. We are guided by efficiency considerations and the need of new search paradigms like personalization, that require re-ranking of initial search results based on various criteria rather than launching a fresh search into the entire corpus.
منابع مشابه
Visual Resampling for Pseudo-Relevance Feedback during Speech-based Video Retrieval
A method is proposed that makes use of visual reranking to selectively sample feedback sets for Pseudo-Relevance-Feedback during speechtranscript-based video retrieval. Observed performance improvement is indicative of the ability of visual reranking to increase the relevance density of the feedback set.
متن کاملDublin City University at CLEF 2006: Robust Cross Language Track
The main focus of the DCU group’s participation in the CLEF 2006 Robust Track track was to explore a new method of reranking a retrieved document set based on the initial query with a pseudo relevance feedback (PRF) query expansion method. The aim of re-ranking using the initial query is to force the retrieved assumed relevant set to mimic the initial query more closely while not removing the b...
متن کاملMining and Ranking Biomedical Synonym Candidates from Wikipedia
Biomedical synonyms are important resources for Natural Language Processing in Biomedical domain. Existing synonym resources (e.g., the UMLS) are not complete. Manual efforts for expanding and enriching these resources are prohibitively expensive. We therefore develop and evaluate approaches for automated synonym extraction from Wikipedia. Using the inter-wiki links, we extracted the candidate ...
متن کاملUnsupervised training methods for discriminative language modeling
Discriminative language modeling (DLM) aims to choose the most accurate word sequence by reranking the alternatives output by the automatic speech recognizer (ASR). The conventional (supervised) way of training a DLM requires a large amount of acoustic recordings together with their manual reference transcriptions. These transcriptions are used to determine the target ranks of the ASR outputs, ...
متن کاملDCU@FIRE2010: Term Conflation, Blind Relevance Feedback, and Cross-Language IR with Manual and Automatic Query Translation
For the first participation of Dublin City University (DCU) in the FIRE 2010 evaluation campaign, information retrieval (IR) experiments on English, Bengali, Hindi, and Marathi documents were performed to investigate term conflation (different stemming approaches and indexing word prefixes), blind relevance feedback, and manual and automatic query translation. The experiments are based on BM25 ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009